Many Body physics and the capacity of quantum channels with memory. 
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In most studies of the capacity of quantum channels, it is assumed that the errors in each use of 
the channel are independent. However, recent work has begun to investigate the effects of memory 
or correlations in the error, and has led to suggestions that there can be interesting non-analytic 
behaviour in the capacity of such channels. In a previous paper we pursued this issue by connecting 
the study of channel capacities under correlated error to the study of critical behaviour in many- 
body physics. This connection enables the use of techniques from many-body physics to either 
completely solve or understand qualitatively a number of interesting models of correlated error with 
analogous behaviour to associated many-body systems. However, in order for this approach to work 
rigorously, there are a number of technical properties that need to be established for the lattice 
systems being considered. In this article we discuss these properties in detail, and establish them 
for some classes of many-body system. 
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I. INTRODUCTION 

One of the most important problems of quantum infor- 
mation theory is to try to determine the channel capacity 
of noisy quantum channels. In a typical scenario, Alice 
would like to send Bob information over many uses of 
a noisy quantum communication link. As the channel 
is noisy, this cannot usually be done perfectly, and so 
they must use some form of block encoding to combat 
errors. The channel capacity is defined as the optimal 
rate at which information may be transferred with van- 
ishing error in the limit of a large number of channel 
uses. There are a variety of different capacities, depend- 
ing upon whether Alice and Bob are interested in trans- 
mitting classical or quantum information, and whether 
they have extra resources such as prior entanglement. In 
this paper we will be concerned mostly with the capacity 
for sending quantum information, and so whenever we 
write the term 'channel capacity' we will implicitly be 
referring to the quantum channel capacity. 

In most work on these problems, it has usually been 
assumed that the noisy channel acts independently and 
identically for each channel use. In this situation the 
transformation corresponding to n-uses of the channel 
may be written as an ri-fold tensor product of the single- 
use channel £i: 



En = £\®£\' 



(1) 



However, in real physical situations there may be cor- 
relations in the noise that acts between successive uses, 
an interesting example being the decoherence of photons 
optical fibres under the action of varying birefringence, 
which can be correlated due to mechanical motion or slow 
temperature fluctuations In such situations one can- 
not describe the action of the channel in a simple tensor 
product form: 



In this setting one must really describe the action of the 
channel by a family of quantum operations corresponding 
to each number of uses of the channel n = 1, 2, ...oo: 



Tlj 



(3) 



We will call any such family of operations a memory 
channel or a correlated channel [soj . Defining the notion 
of channel capacity for such a correlated channel is not 
always straightforward. In principle a family of channels 
such as ([3]) may not have any sensible limiting behaviour 
as n — > oo 0]. However, in this paper wc will not need to 
discuss this issue in detail, as we will only consider fairly 
regular channels that have a (unique) well-defined notion 
of channel capacity. 

In the case of uncorrelated errors, it has recently been 
shown [lo| that the quantum channel capacity of an un- 
correlated quantum channel is given by: 



Q{£) 



lim 

n— !-oo 



(4) 



where /(^) is the so-called coherent information of the 
quantum channel ^: 

m := sup s[^[p)) - s[i ® m) m) (5) 

p 

where S denotes the von-Ncumann entropy, p is a state, 
and is a purification of p. 

Given that equation is the quantum channel capac- 
ity for memoryless channels, it is natural to hope that the 
corresponding expression: 



lim 



(6) 



(2) 



will represent the quantum channel capacity in the case 
of correlated errors. However, this will not always be the 
case, not least because this limit does not always exist 
[6|. However, in this paper we will not only assume 
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that this hmit exists, we will also initially work under the 
assumption that it represents the true quantum channel 
capacity. We will later discuss this assumption in some 
detail. 

A similar situation occurs for the classical capacity of 
correlated quantum channels, where formulae (j4l6p can 
be replaced with similar expressions involving the Holevo 
quantity instead of the coherent information. Most prior 
work on calculating the capacities of correlated quan- 
tum channels has focussed on the capacity for classi- 
cal information. Numerical and mathematical experi- 
ments involving a small number of channel uses suggest 
that in a variety of interesting cases the classical capac- 
ity of correlated channels can display interesting non- 
analytic behaviour. For instance, the sequence of papers 
(III . Ill, [ll] investigates a certain family of correlated 
channels parameterized by a memory factor jjL G [0, 1] 
which measures the degree of correlations. The results 
of [HI, m, demonstrate that when the correlated 
channel is refreshed after every two uses (i.e. consider 
S2® S2® £2® rather than the full correlated channel 
{£„}), then there is a certain transition value fi — fiQ at 
which the channel capacity displays a definite kink, and 
above this threshold the optimal encoding states sud- 
denly change from product to highly entangled. Similar 
phenomena have subsequently been observed in a variety 
of other cases [13, [l^ . 

Despite these interesting observations, it is still an 
open question whether the sharp kinks in the capacity 
of these models still persist if the full correlated channel 
{£n} is considered as n — s- cx), or whether this behaviour 
is just an artefact of the truncation of the channel at low 
n. The main difficulty in deciding such questions is that 
even under the assumption that equations such as ([6|) 
(or its analogue for classical information - the regularised 
Holevo bound) represent the true quantum capacity of a 
given correlated channel {£n}, in most cases such varia- 
tional expressions are extremely difficult to compute. It 
is however interesting to note that the non-analytic be- 
haviour observed in the channel capacity of correlated 
channels is somewhat reminiscent of the non-analyticity 
of physical observables that define a (quantum) phase- 
transition in strongly interacting (quantum) many-body 
systems, where in contrast true phase transitions usually 
only occur in the n ^ 00 limit. 

Motivated by this heuristic similarity, in a previous pa- 
per [l6t we connected the study of channels with mem- 
ory to the study of many-body physics. One advantage 
of this approach is that allows the construction of a va- 
riety of interesting examples of channels for which equa- 
tion ([6]) can either be understood qualitatively or even 
calculated exactly using the techniques of many-body 
physics. One would otherwise usually expect regularized 
equations such as ^ to either be quite trivial or com- 
pletely intractable. This is perhaps the most important 
consequence of this line of attack - by relating corre- 
lated channels directly to many-body physics, wc obtain 
a good method for displaying models of channels with 



memory that tread the interesting line between 'solvabil- 
ity' and 'non-solvability', in analogy with the many such 
statistical physics models that have been proposed over 
the years. It is quite possible that the insights of uni- 
versality, scaling, and renormalisation that have been so 
successful in many-body theory may provide valuable in- 
tuition for the study of channels with correlated error. 

Another advantage of this approach is its connection to 
physically realistic models of correlated error. One can 
imagine that in many real forms of quantum memory, 
such as optical lattices, any correlated errors might orig- 
inate from interaction with a correlated environment and 
thus be strongly related to models of statistical physics. 
This provides further physical motivation to examine 
the properties of correlated channels with a many-body 
flavour. 

The connection to many body physics also naturally 
leads one to consider channels with structure in 2 or more 
spatial dimensions. In such situations it is no-longer ap- 
propriate to think of correlations as 'memory', as the 
correlations arise not through a single time dimension, 
but perhaps through spatial proximity in more than one 
dimension. In order to define a capacity in such multi- 
dimensional situations one would have to decide how to 
quantify the size of the channel. Natural options could 
include the total number of particles in the system, or 
perhaps the size of one linear dimension. Although we 
will not explicitly discuss multi-dimensional examples in 
this work, such situations might have interesting connec- 
tions to the study of error tolerance in computational 
devices. 

This paper is structured as follows. In order to make 
the paper self-contained, in the sections preceding IVIII 
we present, including all missing detail, the results of 
[l6t . In section IVIII we discuss in detail some sufficient 
conditions that many-body systems must satisfy in order 
to lead to capacity results according to the approach that 
we adopt - the arguments that lead to the development 
of these conditions were sketched in [l^, however here 
we provide the full argument. In sections IVIIII IIXI we 
prove that these conditions hold for finitely correlated 
states and formulate a Fannes type inequality to show 
the same result for harmonic chains. In the remaining 
sections we discuss generalisations of our approach and 
present conclusions. 



II. MANY-BODY CORRELATED CHANNELS. 

In this section we re-cap the approach taken in [l6j 
to construct correlated error models with links to many- 
body physics. The starting point is to suppose, as usual, 
that Alice transmits a sequence of particles to Bob (the 
'system' particles), and that each particle interacts via 
a unitary U with its own environmental particle. So far 
this is exactly the same setting as uncorrelatcd noise. 
However, although each system particle has its own sep- 
arate environment, one can introduce memory effects by 
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system particle with its associated environmental parti- 
cle. In order to provide concrete examples, one must 
hence make a judicious choice of U in order to make an- 
alytical progress. As in we choose U to be of the 
form of a controUed-unitary interaction, where the envi- 
ronmental particles act as controls. In fact, for ease of 
explanation we will also initially restrict the system and 
environment particles to be 2-lcvel spins, and the interac- 
tion U to be a controUed-phase ('CPHASE') gate, which 
in the computational basis for 2-qubits is defined as. 



CPHASE 
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(7) 



FIG. 1: Each particle that Alice sends to Bob interacts with 
a separate environmental particle from a many-body system. 



asserting that the environment particles are in the ther- 
mal/ground state of a many-body Hamiltonian, such that 
the interaction terms lead to correlations in the environ- 
mental state (see figured]). Unlike the uncorrclated case, 
this means that there will be correlations in the noise 
on different system particles. At this point it is impor- 
tant to discuss some of the subtleties involved in the way 
that the 'many-body' system was defined in [l6| . In ba- 
sic approaches to many-body physics, it is usual to con- 
sider a system with a finite number of particles, obtain 
thermal states and ground states, and then take a limit 
as the number of particles is taken to infinity. In more 
mathematical statistical physics literature [24|, however, 
it is usual to consider genuinely infinite systems from the 
start. This involves a number of technical implications, 
including a very different approach to the concept of a 
state, which can no longer be expressed in terms of basic 
density matrices. The two approaches are not necessar- 
ily equivalent and may lead to different results. To avoid 
such technicalities in this work we will follow the former 
approach, and for each number of uses of the channel 
n, we will consider a many-body system of size n. As 
a family of channels for each n this is a mathematically 
well defined object, and it is a reasonable question to ask 
what the resulting channel capacity is. In later sections 
of the paper we will also assume periodic boundary con- 
ditions to enable us to analyze whether equation (|16|) is 
a valid quantum capacity or not. Again, although this 
seems like an unnatural assertion, it is mathematically 
well defined, and in many systems the boundary condi- 
tions are believed to make a vanishingly small difference 
which disappears in the large n limit. 

Of course even with these simplifications not all many- 
body systems can be solved exactly, or even understood 
qualitatively. Moreover, even if the many-body system 
can be well understood, the computation of the limit 
may still be difficult, and may depend strongly upon the 
choice of the unitary U describing the interaction of each 



Later we will discuss how higher level analogues of the 
CPHASE enable similar connections to many-body the- 
ories with constituent particles with a higher number of 
levels. The reason we make these choices for the con- 
trolled unitary interactions is that explicit formulae may 
be derived for the capacity in terms of relatively sim- 
ple entropic expressions which are especially amenable 
to analysis. 

The restriction to controUed-unitary interactions also 
enables us to consider environment particles that are clas- 
sical. For instance, in the case of classical environment 
2-level spins, the 'CPHASE' interaction will be taken to 
mean that the system qubit undergoes a Pauli-Z rota- 
tion when the environment spin is up, otherwise it is left 
alone. It turns out that by considering classical envi- 
ronments it is possible to make more direct connections 
between the channel capacity of our models and concepts 
from statistical physics. 

So let us proceed in trying to understand the capacity 
in cases in which the system particles are all two-level 
systems, with a CPHASE interaction. It is helpful to 
write the resulting channels in a more explicit form. Let 
us consider a quantum environment first. Let |0) denote 
spin-down, and |1) denote spin- up. Let us suppose that 
the environment consists of N spins (eventually we will 
be interested in the limit — > oo) initially in a state: 



P:K.y\xiX2...XN) {yiy2--yN\ 



(8) 



where the sum is taken over all N-bit strings x, y, and 
Xj/yj denote the jth bit of strings x/y respectively. We 
can also describe a classical environment in the same way, 
simply by restricting the input environment state p to be 
diagonal in the computational basis - the CPHASE inter- 
action will in this case leave the environment unchanged, 
and will affect the system qubits as if the controls are 
entirely classical. 

If the environment is in the state ([5]), and the system 
qubits are initially in the state a, then the channel acting 
upon the system qubits is given by: 



it 



(9) 
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where Zi denotes the PauU-Z operator acting upon qubit 
i. Hence, regardless of whether the environment is con- 
sidered quantum or classical, the channel that we have 
described is a probabilistic application of Z-rotations on 
various qubits. Although we will consider qubit i to be 
transmitted earlier in time than any other qubit 7 with 
i < j, there is no need for us to actually impose such 
a time ordering - because all the CPHASE interactions 
commute with each other, such time ordering is irrelevant 

Hi - . . 

We will be interested in computing equation ([5]) for 
such many-body correlated channels. In the next section 
we will show that the channel capacity of this channel is 
given by a simple function of the entropy of the diagonal 
elements in the spin up/down basis of the environmental 
state, i.e. 

-^P^,^'^ogp^.^. (10) 

X 

In the case of a classical environment this is just the 
actual entropy of the spin-chain. This observation is very 
useful, as it allows us to apply all the formalism of many- 
body physics to the problem, also enabling us to that 
intuition to observe a number of interesting effects. In 
the quantum case this function does not correspond to a 
conventional thermodynamic property, however, we will 
discuss examples where it is still amenable to a great deal 
of analysis using many-body methods. 

III. A FORMULA FOR THE COHERENT 
INFORMATION OF OUR MODELS. 

In order to calculate the regularised coherent informa- 
tion ([6]) for our many-body correlated channels, we will 
utilise the close relationship between the quantum chan- 
nel capacity and the entaiiglement measure known as the 
Distillahle Entanglement [25|. This connection utilises a 
well known mapping between quantum operations and 
quantum states. Given any quantum operation £ act- 
ing upon a d-level quantum system, one may form the 
quantum state: 

J{£)^I®£{\+){+\) (11) 

where |-|-) = X]i=i d N) canonical maximally 

entangled state of two d-level systems. The state J{£) 
is sometimes referred to as the Choi-Jamiolkowski state 
of the operation £ It can be shown that the map- 
ping from £ to J{£) is invertible, and hence the state 
J{£) gives a one-to-one representation of a quantum op- 
eration. We will show that for the kinds of correlated 
error channel that we have described above in equation 
([5]), the quantum channel capacity Q{£) of the channel 
equals D{J{£)), the distillable entanglement of the state 
J{£). 

To make the presentation more transparent, we will 
make the argument for the Choi-Jamiolkowski (CJ) state 



of a particular single qubit channel, as it is straightfor- 
ward to generalize the argument to the entire family of 
memory- channels described above. Hence let us consider 
the following single qubit 'dephasing' channel: 

£ : p^pp+{l-p)ZpZ'' (12) 

where p is a probability, and Z is the Pauli Z operator. 
The CJ representation of this channel is: 

Ji£)^I®£{\+){+\) (13) 

where is chosen as in equation pT|) . 

The argument relies upon the fact that the channel 
possesses some useful symmetry. This symmetry 
leads to the property that having one use of the channel is 
both mathematically and physically equivalent to having 
one copy of J{£). Suppose that you have one use of 
you can easily create J{£). However, it turns out 
that with one copy J{£) you can also implement one use 
of £. Hence both the operation and the CJ state are 
physically equivalent resources. The argument works as 
follows. Suppose that you have J{£) and you want to 
implement one action of £ upon an input state p. This 
can be achieved by teleporting p through your copy of 
J{£). This will leave you with the state £{aipa\), with 
the Pauli operator tJi depending upon the outcome of the 
Bell measurement that does the teleportation. However, 
the channel (fT^ commutes with all Pauli rotations. So 
we can "undo" the effect of the Pauli by applying the 
inverse of ct^, which for Paulis is just cr^ itself. Hence we 
have: ai£{aip(jl)(Ti = £{p). Hence by teleporting into 
J(£) and undoing the Pauli at the end we can implement 
one use of the operation. 

This observation allows us to relate the channel capac- 
ity of the channel to the distillable entanglement of the 
CJ state. The proof proceeds in two steps, and follows 
well known ideas taken from articles such as [25j . The 
aim is to show that the 1-way distillable entanglement of 
J{£) is equivalent to Q{£), so that previous results on 
D{J{£)) may be applied. 

(1) PROOF THAT Q{£) < 1-way distillation: (1) Alice 
prepares many perfect EPR pairs and encodes one half 
according to the code that achieves the quantum capac- 
ity Q{£). (2) She teleports the encoded qubits through 
the copies of J{£), telling Bob the outcome so that he 
can undo the effect of the Paulis. (3) This effectively 
transports all encoded qubits to Bob, at the same time 
acting on them with £. (4) Bob does the decoding of the 
optimal code, thereby sharing perfect EPR pairs with Al- 
ice, at the rate determined by Q{£). As this is a specific 
one-way distillation protocol, this means that Q < D. 

(2) PROOF THAT Q{£) > 1-way distillation: (1) Alice 
prepares many perfect EPR pairs and sends one half of 
each pair through many uses of the channel £, (2) She 
and Bob do one way distillation of the resulting pairs 
(this involves only forward classical communication from 
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Alice to Bob). (3) Thereby they share perfect EPR pairs, 
at the rate determined by D{J{£)), the 1-way distillable 
entanglement of . (4) They can use these EPR pairs to 
teleport qubits from Alice to Bob. As this is a specific 
quantum communication protocol, this means that Q > 
D. 

These arguments can easily be extended to apply to 
any channel that is a mixture of Pauli rotations on many 
qubits, hence including the memory channel models that 
we have described above. Hence to calculate the quantum 
channel capacity of our channels we must calculate the 
Distillable entanglement of the channel's CJ state. For- 
tunately, the CJ state of our channel is a so-called maxi- 
mally correlated state, for which the distillable entangle- 
ment is known to be equivalent to the Hashing bound: 



D{J{£)) = S{tr^{J{£)}) - S{J{£)) 



(14) 



where S is the von-Neumann entropy. Note that for such 
channels £ this expression is equivalent to the single copy 
coherent information, which is hence additive for prod- 
uct channels f^". In our case we are interested in the 
regularised value of this quantity for correlated channels. 



Qm}) = hm »11 ^ lim SiJ(S„U)-SiJi£„)) 



n — ^oo fi 71 — *oo 



which can be computed quite easily as: 



(15) 



Q({£4) = 1 - lim ''^^'^^^P-^^ (16) 



n— >oo Tl 



where Diag(penii) the state obtained by eliminating all 
off-diagonal elements of the state of the environment (in 
the computational basis). Hence the computation of the 
quantum channel capacity of our channel {£n} reduces 
to the computation of the regularised diagonal entropy 
in the limit of an infinite spin chain. Although in most 
cases this quantity is unlikely to be computable analyt- 
ically, it is amenable to a great deal of analysis using 
the techniques of many-body theory. It is also interest- 
ing to note the intuitive connection between expression 
(|16p and work on environment assisted capacities - in 
the case of random unitary channels, where the unitaries 
are mutually orthogonal, the diagonal entropy in expres- 
sion (fTB)) has a natural interpretation as the amount of 
classical information that needs to be recovered from the 
environment in order to correct the errors pTi . [l8| . 

Although the above analysis has been conducted for 
2-level particles, it can be extended to situations involv- 
ing d-level systems. In the d-level case one can replace 
CPHASE with a controlled shift operation of the form: 



E \k){k\^Zik) 



(17) 



i=l..a 



where the Z{k) = J^j exp(i27rfcj/d)|j) (j| are the versions 
of the qubit phase gate generalized to d-level systems. 



and the first part of the tensor product acts on the envi- 
ronment. With this interaction all the previous analysis 
goes through, and the d-level version of eq. dTT 



Q{{£n}) = log(d) - lim 



5(Diag(pe™)) 



(18) 



gives the regularized coherent information, where 
Diag(p) refers to the diagonal elements in the d-level com- 
putational basis. It is important to consider the gener- 
alization to d-level systems because the thermodynamic 
properties of many-body systems do not always extend 
straightforwardly to systems with a higher number of lev- 
els. For instance, one possible generalization of the Ising 
model to c?-lcvel systems is the Potts model, which leads 
to some very interesting and non-trivial mathematical 
structure [l9| , and in the quantum Heisenberg model the 
presence of a ground state gap depends on where the 
spins in the chain are integral or half-integral [36l |. 

The simplicity of equation p6p enables one to immedi- 
ately write down many noise models for which the regu- 
larized coherent information can both be calculated, and 
also represents the quantum channel capacity of the cor- 
related channel. In particular, let us suppose that the 
environment consists of classical systems described by a 
classical Markov Chain (those readers not familiar with 
the Markov chain terminology required here please see 
chapter 5 of [1^ for a very readable introduction) . If the 
state at each 'site' s in the environment represents the 
instantaneous state of a Markov chain at time s, then 
the regularised entropy in equation (|18p is given by the 
entropy rate of the Markov chain |23 | . p rovided that the 
Markov process is both irreducible [5l| and possesses a 
unique stationary (equilibrium) state. Let the transition 
matrix of M of the Markov chain be defined such that 
Pi{s -I- 1) = J2i ^ijP{^)jj Vi be the ith element of the 
stationary probability distribution, and let Hi be the en- 
tropy of column i in the Markov chain transition matrix. 
With these conventions the entropy rate is given by: 



lim 



S{Diag{pe„v)) 



E 

= l...,d 



ViHj 



(19) 



In these cases the correlated channels fit quite neatly into 
the class of models proposed in 0] , and moreover these 
channels will be forgetful 0] . As proven in Q , for forget- 
ful channels the regularized coherent information is equal 
to the quantum capacity (see [1^ for an independent cod- 
ing argument which also works for Markov chain chan- 
nels implementing generalised Pauli rotations). Hence for 
these models equation ([T5|) represents the true quantum 
channel capacity, and so we may write explicitly: 



Q (Markov) = log(d) - E 



(20) 



i=i, 



When unique, the stationary distribution of a Markov 
chain is given by the unique maximal right eigenvector 
(of eigenvalue 1) of the transition matrix. Related results 
have been obtained independently in the articles [13, [2l[ . 
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IV. ENVIRONMENT THAT IS A CLASSICAL 
SYSTEM 

In the case of a classical environment, the second term 
of equation (|16[) is precisely the entropy of the environ- 
ment, and so it can easily be computed in terms of the 
partition function. 

The partition function of the classical system is defined 
as: 

Z = ^exp(-/3i?,) (21) 

i 

where the Ei are the energies of the various possible con- 
figurations, and (3 = l/(fcBT), with T the temperature 
and ks Boltzmann's constant. The entropy (in nats) of 
the system is given by the following expression: 

5(Diag(pe„.)) = (^1 - hi Z. (22) 

This means that in the case of a classical environment 
our channel capacity becomes 

Q({£„}) = l-log2(e)fl-/?J-) hm llnZ (23) 

\ op ) n— ►oo n 

where the log2(e) converts us back from nats to bits. 
This expression means that we can use all the machin- 
ery from classical statistical mechanics to compute the 
channel capacity. 

In particular, any spin-chain models from classical 
physics that can be solved exactly will lead to channels 
with memory that can be 'solved exactly' (provided that 
one can show that the regularized coherent information 
is indeed the capacity, a problem that we shall discuss in 
later sections). The most famous example of an 'exactly 
solvable' classical spin-chain model is the Ising model. 
We will discuss the classical Ising model in detail in the 
next section, as it will also be relevant to a certain class 
of quantum spin-chains. 

However, there are also many classical spin-chain mod- 
els that cannot always be solved exactly, but which can be 
connected to a wide variety of physically relevant models 
with interesting behaviour. As just one example, con- 
sider modifying the Ising spin-chain model to allow ex- 
ponentially decaying interactions between non-adjacent 
spins. The resulting model can be related to a quan- 
tum double-well swtem, and is also known to exhibit a 
phase transition [27[ . This means that the corresponding 
correlated channels will also exhibit similar behaviour, 
provided of course that the limit eq. ^ truly represents 
the quantum channel capacity for the models. 

In this paper we will not give detailed discussion of any 
further models involving a classical environment (other 
than the classical ID Ising chain, which we will discuss 
in the next section). As our expression (|16p is simply 
the entropy of the classical environment, the interested 
reader may simply refer to the many interesting classical 



models (both solvable and almost solvable) that are well 
documented in the literature. Of course, to make the 
analysis rigorous one would need to show that expression 
([5]) is the formula for the quantum capacity in these cases. 
However, we conjecture that for most sensible models this 
should be true. In the final section of the paper we will 
present an analysis that demonstrates this for a family 
of ID models. 



V. QUANTUM ENVIRONMENTS 

Unfortunately expression (|16p does not correspond to 
a standard thermodynamic function of the environment 
state when the environment is modelled as a quantum 
system. It represents the entropy of the state that re- 
sults when the environment is decohered by a dephas- 
ing operation on every qubit. Although this quantity is 
not typically considered by condensed matter physicists, 
there is some hope that it will be amenable to analysis 
using the techniques of many-body theory. 

In this paper we will make a small step towards justify- 
ing this hope by analytically considering a class of quan- 
tum environments inspired by recent work on so-called 
Finitely Correlated or Matrix Product States [2^ . 

We will leave attempts to analytically study more com- 
plicated models to another occasion, although in figure 
(PI we present some numerical evidence that the quan- 
tum ID Ising model displays a sharp change in capacity 
at the transition point. 



VI. QUANTUM CAPACITY FOR FINITELY 
CORRELATED ENVIRONMENTS DESCRIBED 
BY RANK-1 MATRICES 

Finitely Correlated or Matrix Product states are a spe- 
cial class of efficiently describable quantum states that 
have provided many useful insights into the nature of 
complex quantum systems [28| . In a recent paper (33j 
it has been demonstrated that a variety of interesting 
Hamiltonians can be constructed with exact matrix prod- 
uct ground states, such that the Hamiltonians in question 
undergo non-standard forms of quantum 'phase transi- 
tion'. 

As matrix product states are relatively simple to de- 
scribe, one might hope that for such ground states 
the computation of equation (|16p may be particularly 
tractable. In this section we will see that for matrix 
product states involving rank-1 matrices the analysis is 
particularly simple, and may be reduced to the solution 
of a classical ID Ising model. 

Let us consider a ID matrix product state, where each 
particle is a 2-level quantum system, |0), |1). Let us as- 
sume that the matrices associated to each level are inde- 
pendent of the site label, and are given by Qo for level 
|0) and Qi for level |1). Hence the total unnormalised 
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FIG. 2: Numerics for the quantum Ising model suggest that 
there may be transition behaviour in the capacity at the phase 
transition point of the ID quantum Ising model. In this figure 
the central point of the horizontal axis is the transition point 
of the quantum Ising model, and the curves become increas- 
ingly steep as the number of spins is increased from 6 to 18. 
As the quantum Ising model can be solved exactly in ID, it 
is quite possible that an analytical solution may be found for 
the channel capacity. 



B arc both square rank-1 matrices. Some of the example 
Hamiltonians discussed in [ssl ] have ground states with 
this property, and in fact some special cases of the noise 
models presented in [ll|, [l^, [U, [3 can also be expressed 
in the form of matrix product environments with rank-1 
matrices (although in general those models require more 
than two matrices as they require environmental spins 
with more than 2 levels). Wc will show that in such sit- 
uations the diagonal entropy in the computational basis 
is equivalent to the entropy of a related classical Ising 
chain. 

The first thing to note is that rank-1 matrices are al- 
most idempotent. In fact, if A, B are both rank-1 matri- 
ces, then we have that: 



A" 



'A 



B'' 



B 



(28) 



where a is the only non-zero eigenvalue of A, and b is 
the only non-zero eigenvalue of B. Note that because of 
the form of A and B as the tensor product of a matrix 
and its complex conjugate, these eigenvalues a, b must be 
non-ncgativc. We can define the normalised matrices: 



a 



(29) 



These normalised matrices are idempotent. To see how 
this can help, consider a particular string, say, 



state can be written as: 



ty{QiQjQk---)Vjk. 



(24) 



.6(0,1} 



From the form of expression (jl6p we see that we are only 
interested in the weights of the diagonal elements in the 
computational basis, or equivalently the state that results 
from dcphasing each qubit. It is easy to see that this 
unnormalised state will be given by: 

p= (25) 

i.],..e{o.i} 

In this expression if wc relabel the matrices A — Qq (g) Qq 
and B ~ Qi (E) Ql then probability of getting various 
outcomes when measuring the environment in the com- 
putational basis will be given by traces of all possible 
products of the As and Bs. For instance, the probability 
of getting 01100... when measuring the environment in 
the computational basis will be given by: 



1 



Poiioo... — 



CiN) 



tr{ABBAA...} 



(26) 



where TV is the number of qubits in the environment, and 
C{N) is a normalisation factor given by: 



C{N) =ti{{A + B)^} 



(27) 



C{N) can be computed by diagonalisation. In the rest 
of this section wc will be interested in cases where A and 



Poiiiooo 



CiN) 



tr{{ABBBAAA)} 



if wc substitute A and B into this expression, and use 
the idempotency, then the strings of consecutive As and 
Bs will collapse to just one A ot B, with total factors of 
and b^ inserted outside the trace: 



Poiiiooo 



C{N) 



■{{aba)} 

{(-)} 



(30) 



It is easy to see that this form is quite general - the 
probability of getting a particular string will collapse to 
a simple expression. If there are I occurrences of A and 
n — I occurrences of B in the string, and K counts the 
number of boundaries between blocks of As and blocks 
od Bs, then the probability of the string becomes: 



1 



CiN) 



(a'6^-')tr|(iB)''| 



Noting that AB will also be a rank-1 matrix, let us use 
the letter c to refer to its only non-zero eigenvalue. Hence 
the probability becomes: 



1 



luN-l„K 



CiN) 



a'b 



(31) 



This expression tells us quite a lot - firstly for any given 
channel described by rank-1 MPS states, the only param- 
eters that matter are a, b, c. So we needn't work with the 
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actual matrices defining our state, we only need to work 
with matrices of our choosing that have the same param- 
eters a, b and c. In the following we will assert that c is 
non-negative - this is guaranteed because of the following 
argument: it holds that c = tr{Ai3}, because AB is rank- 
1, but because AB = QoQi ® QoQi/(a&), where a, b arc 
non-negative, this means that c must be non-negative. 
So let us just go ahead and pick the following matrices: 



\B = 



(32) 



These matrices clearly have non-zero eigenvalues a, b re- 
spectively. So what about the eigenvalue of ABl For the 
above choice of matrices we find that: 



AB 



c 





(33) 



Hence we find that the matrices that we have chosen 
have the correct values of a, 6, c, as required. Now we 
notice that the matrices that we have chosen in equation 
(|32p are very similar to the matrices that would define 
a classical Ising chain. In fact, if we make the following 
change of variables from a, b, c to J, D, M: 



a^cxp{l3{J + M)), 
6 = exp(/3(J-M)), 
c = exp(-/3(4J + 2D)). 

The inverse transformations are: 

/3J= (ln(a)+ln(6))/2, 
f3M = (ln(a) -ln(6))/2, 
(3D = -(ln(a) + ln(6)) - (1/2) ln(c). 



(34) 



(35) 



It turns out that the parameters J, D will represent cou- 
pling constants and M will represent a magnetic field. To 
see this let us insert the new parameters into the choice 
of ^, B in equation ((32)) . Then we get that the matrices 
(l32ll can be written: 



A 



B 



exp(/3(J + M)) exp(-/3(J + D)) 




exp(-/3( J + D)) exp{l3{J - M)) 



(36) 



The matrices in such a rank-1 MPS are essentially the 
top row and bottom row of a transfer matrix. Comparing 
these matrices to the classical Ising transfer matrix we see 
that the following Hamiltonian (where for convenience 
we now follow the usual physics convention that Si G 

H = - JsjSi+i - Msi + D{1 - SiSi+i) 

i 

= -^(J-i?)s.s,+i-Ms, + (37) 



The D is just a constant shift in spectrum, so we can 
simply consider the Ising chain with Hamiltonian: 



H = - 



(J - D)s,s,+i - Ms^ 



(38) 



The partition function for such a chain of N particles 
depends upon the transfer matrix for this (rescaled!) 
Hamiltonian: 

/ cxp(/?( J -D + M)) cxp(-/3(J - D)) \ 

cxp{-p{J-D)) cxp{f3{J - D - M)) J ^-^^f 

Now from the partition function we can calculate the 
entropy, and hence the capacity of our channel. The 
formula turns out to be: 



l-log2(e)(l-/?|^)lnAi 



(40) 



where Ai is the maximal eigenvalue of the transfer matrix 
([55)1 . Using these equations and equation ()35p one can 
perform the (tedious) manipulation required to derive a 
formula for the regularized coherent information in terms 
of the coefficients a,b,c. Although we do not present the 
formula that is obtained, figure ([3]) shows the result for 
the model Hamiltonian presented in [33l |: 



H 



^2(.9^-l)4'Vr^^-(l+g)V^ 

i 

+ (5-l)^ari)a«ari). (41) 



for which the ground state is known to be a matrix prod- 
uct state of the form: 



Qo 





1 1 



Qi 



1 9 




This model system has a non-standard 'phase transition' 
at 5 = 0, at which some correlation functions arc continu- 
ous but non-diffcrentiable, while the ground state energy 
is actually analytic [3^. As discussed in the caption of 
figure ([3]), this behaviour is mirrored in the channel ca- 
pacity. 



VII. CONDITIONS UNDER WHICH THE 
REGULARISED COHERENT INFORMATION 
REPRESENTS THE TRUE CAPACITY. 

In this section we will explore under what conditions 
our assumption that the regularized coherent information 
of equation (O: 



Q{{£n}) := Hm 



(42) 



correctly represents the true quantum capacity of our 
correlated channels, assuming of course that this limit 
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Diverging gradient 



oooooooooooooocoooooooooooooo 



FIG. 4: The Live blocks of length I are separated by Spacer 
blocks of length s. By discarding the spacer particles the 
channel effectively becomes a product channel on the Live 
blocks. 



g-0 



A. A qualitative description of the argument. 



FIG. 3: This schematic figure shows the channel capacity 
when the environment is the ground state of the Hamiltonian 
given in equation (|41|l . The symmetry in this plot is to be 
expected as the channel is invariant under the replacement 
g — > —g. However, near the 'phase transition' point 3 = 0, 
the gradient diverges. 



exists. In the course of the discussion we will also need 
to consider under what conditions the regularized Holevo 
bound: 

C(£„) := lim ^ (43) 

n — ^oo fi 

represents the capacity of the channel for classical infor- 
mation. The Holcvo bound x(f ) for a quantum channel 
£ is defined as [l|: 

Xi£)= sup (44) 
{p., p.} i i 

where the supremum is taken over all probabilistic en- 
sembles of states {pi,Pi\^ and S as usual represents the 
von Neumann entropy. As pointed out in @, Q , showing 
that equations are upper bounds to the quan- 

tum/classical capacity of a correlated channel is straight- 
forward - one can use exactly the same arguments used 
in the memoryless case [1, O, 113, HE]- Showing that 
equations ((i^ / also give lower bounds to the rele- 
vant capacities is not as simple, and may not be true for 
some many-body environments. 

However, it turns out that if the correlations in the 
many-body system fall off sufficiently strongly, then the 
channel will be reasonably well behaved and equation 
is true. In this section we will make this statement 
quantitative. We will closely follow the approach taken 
in in the analysis of so-called forgetful channels. Some 
of the subtleties involved in the analysis are explained in 
more detail in Section VI of that paper. The conditions 
that we obtain are independent of the unitary which gov- 
erns the interaction between each system particle and its 
corresponding environment, and so are applicable more 
widely than the dephasing interaction considered here. 



In this subsection we present an intuitive sketch of the 
argument that we will follow. Imagine that the corre- 
lated channel is partitioned into large blocks that we shall 
call live qubits, separated by small blocks that we shall 
call spacer qubits. The idea is to throw away the spacer 
qubits, inserting into them only some standard state, and 
to only use the live qubits to encode information (see fig- 
ure |4|). If we are to follow this procedure, then we will 
not be interested in the full channel, but only in its effect 
upon the live qubits. Let us use the phrase live channel 
to describe the resulting channel - i.e. the reduced chan- 
nel that acts on the live qubits only. If the correlations in 
the many-body system decay sufficiently strongly, then 
by throwing away just a few spacer qubits we will find 
that the live channel closely approximates (in a sense to 
be discussed later) a memoryless channel. Let us call this 
memoryless channel the product channel. One can imag- 
ine trying to use the codes that achieve the capacity of 
the product channel, without any further modifications, 
as codes for the live channel. It turns out that under the 
'right conditions' these codes are not only good codes 
for the live channel, but their achievable rates approach 
equation ((42|) . The goal of the next subsection will be to 
explore exactly what these 'right conditions' are. 

The quantitative arguments follow the method used in 
, where three steps are required to show that equation 
(|42)) is an achievable rate: 

[A ] First we must show that product codes for the 
transmission of classical information are good codes 
for the Live channel. 

[B ] Then we must show that these good codes allow 
the regularized Holevo quantity to be an achievable 
rate. This is done by showing that the product 
channel Holevo quantity (which can be achieved by 
product codes) essentially converges to the regular- 
ized Holevo quantity for the whole channel. 

[C ] Then we must argue that these arguments for the 
transmission of classical information can be ' coher- 
entified (in the manner of [lOj ) to a good quantum 
code attaining equation (|42|) . 

In the next subsection we go through this process in detail 
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to derive sufficient conditions to demonstrate the validity 
of equation (|^^ for our many-body channels. 

B. Derivation of the conditions. 

In this subsection we will go through steps [A],[B],[C] 
in turn. 



1. Step [A] 

We will assume that the many-body systems in ques- 
tion satisfy periodic boundary conditions and are trans- 
lationally invariant (this means that the corresponding 
correlated channel {£n} does not quite fit into the def- 
inition of causality proposed by [6|, however, it allows 
us to avoid the technicalities required to analyze a truly, 
genuinely, infinite many-body system). Let us consider 
a specific length of chain N, split into v = N/{1 + s) 
sections, each consisting of one live block of length I and 
one spacer block of length s := SI << I. In the following 
the sizes N, I will generally be taken to be large enough 
that the statements we use hold. The Live channel will 
be defined by: 

Siiy^ : A tTenv{U{pL,L2....L^ ® A)U''} (45) 

where A represents the state that Alice inputs to the 
live channel, U represents the interaction between the 
environment and A, the labels Li, L2, Ly represent the 
live blocks from sections l,..,v, and the trace is taken 
over the environment. Due to translational invariance 
the reduced state of the environment corresponding to 
each given live block will be same, and so let us denote 
this state by p^v- With this notation, the product channel 
will be defined by: 

^product 

: A^trenv{C/((p5v)^"®A)t/t}. (46) 

Note that both the Live and Product channels have a de- 
pendence upon both the live block length I and the total 
number of spins N. Let us first consider using the Prod- 
uct and Live channels to send classical information. By 
definition, if a given rate R is achievable for the Product 
channel, then for every error tolerance e > there is an 
integer N^^ such that for n > channel uses there exist 
a set of V = [2"'^J codeword nZ-qubit states {pi, ■■■,Pu} 
and a corresponding decoding measurement {Mi, AI^} 
such that: 

{p^)^'U} >l-e Vie (47) 

If the same codebook and decoding measurements are 
used without alteration for the Live channel, then the 
error would be: 

^^W^livei^Pi) ^product 
oduct 

{pi)Mi}. (48) 



As the addition of Alice's state A, the unitary interaction 
U , and the POVM element Mj can all be viewed as one 
new POVM element acting only on the environment, the 
left term in this formula can be bounded by [l| 

|tr{[fii.„e(pi) — Sproduct{Pi)]Mi\\ 

<^IIpl.l.....l„-(p5v)^1Ii 

where \ \X\\i :=tr{\/XtX} is the trace norm. Hence the 
error (|48p in using the product code for the Live channel 
can be bounded by: 

iY{£uve{p^)^'U] > 1 - e - i||pL,L,....L„ - {PnT"\\i 

Assume that the rightmost term in this equation is 
bounded by: 

||PL,L.....L„ - {p^Nni < Cvl^cM-Fs) (49) 

for positive constants C, E, F. This assertion will be 
demonstrated for some special cases in section IVIIII 
Then this would mean that the error becomes bounded 
as 

ti{£Hye{p^)M,} > 1 - e - Cvl^cxp{-Fs). (50) 

The e part of this error depends upon the number of 
blocks V. One potential problem that we immediately 
face is that to decrease e we need to increase d, however, 
increasing v inevitably increases the last error term in the 
equation. It is hence not a priori clear that both error 
terms can be made to decrease simultaneously. However, 
it can be shown 0, 0] that if we pick v = l^ , s = 6l,S > 
then both error components can be made to vanish as I 
increases, while still operating at the achievable rates of 
the product channels (in fact, the number of sections v 
could be given any polynomial or subexponential depen- 
dance on I provided that asymptotically v{l) > l^). 

So we see that provided condition pS]) can be demon- 
strated for the many-body systems that we consider, then 
the Product channel works well for the Live channel, as 
long as a large enough live block size is used (however 
small the fraction of spacer qubits i5). Hence equation 
(|49|) is the first of our sufficient conditions. In section 
IVIIII we demonstrate that condition ([49|) (which is iden- 
tical to equation ([55]) later in the paper) holds for some 
interesting classes of many-body system, including ma- 
trix product states. 

2. Step [B] 

Now that we know that the product code is also suit- 
able for the live channel, it is necessary to check that the 
regularized Holevo bound (i.e. the regularized Holevo 
bound for the full channel without throwing spins away) 
is actually an achievable rate for the live/spacer blocking 
code that has been used. In order to make this analysis 
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• • • 



• • • • • 

(a) Total chain 
length Af>> / 



• • • • 



(b) Total chain 
length / + J -/ 



FIG. 5: To show that the Product channel (which is just a 
product of the reduced channel on a single Live block) Holevo 
capacity is essentially the regularized capacity, we need to 
show that the reduced channel on a single Live block is essen- 
tially independent of the total length of the chain. Hence we 
need to show that the reduced state of I contiguous environ- 
ment spins is approximately the same regardless of whether 
the chain is (a) much longer than I, or (b) slightly longer than 
I. 



it will be convenient to define a little more notation. For 
a total chain of length n as before let £„ denote the noisy 
channel. For a contiguous subset of j < n of the spins 
that Alice sends, let £^ denote the effect of the channel 
only upon those spins. Due to translational invariance 
the location of the spins is irrelevant, as long as they 
form a contiguous block. 

A given product channel with live block length I and 
a total number of spins N = v{l + s) = + d) has a 
Holevo quantity given by: 



(51) 



where the • merely acts as a place holder for the inputs 
to the channel. Our goal is to show that for large enough 
I this expression is close to the regularized Holevo bound 
equation (j43ll (see figure [5]). It is not too difficult to de- 
rive conditions under which this will be the case. Suppose 
that we have a spin chain of total length / + A(Z) where 
A(^) << I. In fact we will only be considering functions 

A(Z) > such that lim/_too = 0. The subadditiv- 
ity and the Araki-Lieb inequalities for the entropy ([l|, 
section 11.3.4), i.e., 



S{A) + S{B) > S{AB) > \S{A) - S{B)\ 



(52) 



can be inserted straightforwardly into the Holevo bound 
to show that: 

Xi£l+A) > x{£i+a) - 2A log(d) (53) 

where d is the dimension of each communication spin 
(see also Q)- This equation follows from the fact that 
the Holevo bound is the difference of two entropic terms, 
each of which can change by at most A log(d) under the 
tracing out of A d-level particles. Dividing through by I 
now gives: 



x{£i 



l+A 



) ^ 1 + Ax(£i+a) 
- I l + A 



2jlog{d) 



(54) 



This equation tells us that the Holevo quantity for a sub- 
set of I spins is very close to the Holevo quantity for a full 
chain of Z + A spins, as long as A is small. Our goal now is 
to show that if the subset of I spins is drawn from a much 
longer chain of length N = 1^{1 + S), then the subset still 
has essentially the same value for the Holevo quantity, 
and so the regularized Holevo quantity represents the ca- 
pacity of the product channel. Intuition suggests that if 
the correlations decay fast enough, then it should be the 
case that for N = l^{l + 6) we should have approximately 
Sj^^ ^ ^ given region shouldn't 'feel' how long 

the chain is. Now suppose that we define 

P = P(?, A) := \\p\^^ - p^lli = \\p\^^ - p\e^,+s)\\l■ 

(55) 

Then for a given input state uj on the live block in ques- 
tion the output states will differ by at most: 



< 



||trenv{t/[w® (pI+a -p|6(i+5))]C/t}||i 

|C/[^®U+a-p5«(i+5))]C^^IIi <^a,A). (56) 



Hence Fannes inequality [40] (of which a version suitable 
for our purposes is \S{X) - S{Y)\ <\\X- Y\\i log(d) + 
log(e)/e) can be used to bound the difference in the two 
Holevo functions x{£i+a)tX{£n) follows: 



x{£\ 



I 



> 



Plog(d') 



log(e) 



Putting this equation together with equation ([54|l gives: 
^(4(1+5))^ 1 + Ax{Si+a) 



I l + A 
A 



-2-log(d)-2f y 1 fPlog(d') 



log(e) 



and taking the limit of large I gives 

X(^/2(l + 5)) 



lim 

/ — *oo / 



> Xoo - lim 2(Plog(d)) 

I — ^oo 



So as long as long as we can pick a function A{1) such 
that limj^oo A{l)/l = 0, and such that the norm distance 
P{l,A{l)) vanishes with increasing I then we know that 
the regularized Holevo quantity is the correct capacity. 



3. Step [C] 

Now that we have understood the conditions under 
which the regularized Holevo bound represents the ca- 
pacity for the transmission of classical information, we 
need to try to undertake the same analysis for quantum 
information. As was also exploited in [6|, the way that 
Devetak's work [l3| proves that the regularized coher- 
ent information equals the quantum channel capacity of 
memoryless channels is to first prove a capacity formula 
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for the transmission of private (secret) classical informa- 
tion, and then to make the private coding scheme coher- 
ent. This 'coherentifi-cation' procedure applies directly to 
correlated channels, and so to argue that the regularized 
coherent information (|42|) is also achievable for channels 
with correlated noise, it is sufficient to show that the pri- 
vate information codes that work for the Product channel 
are also suitable for the Live channel. So now suppose 
that a malicious eavesdropper is in charge of the environ- 
ment of our correlated channel. We need to prove that 
the information that she can access is still limited when 
Product private codes are used for the Live channel. We 
can see that the output that Eve obtains is given by: 

£f ™ : A ^ tTsys{U{pL,L,....L^ ® A)^} (57) 

where the tildes mean that environment state p must be 
extended to give a closed system (i.e. p is a pure state), 
the entire environment of which is assumed to be totally 
under Eve's control. In the case of the Product channel 
the privacy condition means that for all e > there is 
a vq such that for all v > vq there exists some standard 
state 9 such that: 

\\tY,ys{U{{p'j^)'^^ ® A)U^} ^e\\,<e (58) 

for all inputs A from the privacy code (readers famil- 
iar with @, [l3| will note that in those works an extra 
randomisation index was included as a label in the code 
states - however, in our context this is unimportant and 
so we omit it for case of notation). Applying the same 
code to the Live channel gives the estimates: 

\\trsys{UipL,L,....L^®A)U^~e\\i 

< e+\\tr^y.{U{[pL,L,....L^ - (p5v)n ® A)um\, 

< ^+||PL,L.....L„-(/5^)^1|i (59) 

The last term in this equation represents the norm dif- 
ference between the purifications of two different possible 
environmental states. We are free to pick the purifica- 
tions that give the greatest overlap between the two envi- 
ronment states. Although this may seem like a contradic- 
tory step, as we should allow Eve to have control over the 
environment, it is in fact valid because the product code 
is by assertion private for all possible extensions of the 
product channel. The coherentification procedure leads 
to the distribution of maximally entangled states which 
arc automatically uncorrclated from the environment, 
whatever purification Eve decided to use. The last line 
from the previous equation hence becomes (using the fact 
that for two pure states the overlap and the trace distance 
are related by 1 1 ( | (^| - | V) (V I ) 1 1 1 = 2 ^l-M(bW, see 
Nielsen & Chuang [H, p. 415 eq. (9.99), noting that the 
factor of 2 comes in from a different convention for the 
trace norm): 

< e + 2^1- F^{pl,l,....l^,{p'n)^'') (60) 

where F is the Uhlmann fidelity [Ij. Hence, using 
the well known relationship between the Uhlmann fi- 
delity and the trace norm of two states (1 — F{x,y) < 



2\\x - y\\i < ^1 - F{x,y)^, Nielsen & Chuang [l|, 
page 416, from which one can obtain a/1 — F(x, yY < 
^2(1 - F[x,y)) < 2^y\\x - y\\i), we find that: 

\\trsys{U{pL,L,....L^®A)U^-e\\i 

< e + V||pL,L,....L„-(p5v)^1|l (61) 

Putting the norm bound (j49p (which we have not yet 
justified) into this equation gives: 

\\trsys{UipL.L,....L^ ® A)U^ - 

< e + i^Cvl^ expi^Fs) (62) 

which is small enough for the assignment v = l^ ,s = 51, 
as long as I is large enough. 

4. Summary of sufficient conditions. 

All of this analysis means that in order to argue that 
the regularized coherent information and the regularized 
Holevo bound arc the true quantum or classical capaci- 
ties, the following two conditions taken together are suf- 
ficient: 

(1) To show that the product codes are also good for 
the partitioned memory channel, 

\\PL,L,....L,, - (Pw)^''l|i < C/^Z^exp(-Fs) (63) 

for some positive constants C,E,F, where = + 
5),s^Sl. 

(2) To show that the regularized coherent information 
is the appropriate rate for the these codes we need to 
show that 

hm ||p1+a(z) = lim ^'(^, A(/)) = (64) 

for some function A(^) such that lim;^oo ^(0/^ — 0- 
fact if equation (|63p holds, in this condition we could re- 
place pj6(-]^_|_5-) with where the number of sections 
V is any function of I with a sub-exponential dependance 
(e.g. a polynomial) that is asymptotically larger than l^. 

To demonstrate that these conditions hold for the most 
general types of many-body system is a non-trivial task. 
However, in a number of interesting cases it is possible 
to prove that these conditions hold. In the remaining 
sections we demonstrate that these conditions hold for 
finitely correlated/matrix product states, as well as for a 
class of ID bosonic system whose ground states may be 
determined exactly. 

VIII. PROOF OF PROPERTY EQ. ^ FOR 
VARIOUS STATES 

In this section we provide proofs for the validity of 
eq. (|63p for a variety of quantum states. These include 
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matrix-product states for which we have discussed ex- 
pUcit memory channels in this paper. In fact the proofs 
that we present for matrix product states are essentially 
contained in previous works such as [1^ . We also demon- 
strate analogous results for the ground state of quasi-free 
bosonic systems as such systems may provide interesting 
examples for future work. In addition to the results we 
present here and in the next section, M. Hastings has 
demonstrated that conditions (|63I64[) hold for certain in- 
teresting classes of fermionic system (ioj . 

Matrix product or finitely correlated states ~ The proof 
that we present here is essentially one part of the proof of 
proposition 3.1 in [28| . Our presentation of the argument 
benefits from the argumentspresented in Appendix A of 
[29| and the review article [30| . 

An important tool in the argument is the use of the 
Jordan canonical form (3lj |. As some readers may be 
unfamiliar with this technique, we briefly review it here. 
If a square matrix M has complex eigenvalues {Aq,}, then 
it can be shown that a basis may be found in which the 
operator can be expressed as the following direct sum: 



M = 0(A„I„+AA„) 



(65) 
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where each I„ is an Identity sub-block with an appro- 
priate dimension, and each Afa is a nil-potent matrix, 
meaning that for each Af^ there is some positive integer 
k such that A/"^' = 0. Moreover, each nilpotent matrix A/'q 
itself may be written as a block-diagonal matrix, where 
each sub-block is either a zero matrix, or is all zero ex- 
cept possibly for Is that may be positioned on the super- 
diagonal. In other words, each sub-block of a given Ma 
is cither zero or is of the form: 



(66) 



The decomposition ([55)1 is the Jordan canonical form of 
M. In our case the matrix M will be constructed from 
a completely positive map that can be associated to the 
matrix product states that we consider. One consequence 
of this, for reasons that we discuss later, is that we will 
ultimately only be interested in operators M whose eigen- 
values satisfy 1 = Ai = |Ai| > IA2I > IA3I > .... For a 
related reason we will also only be interested matrices 
M for which there is a unique eigenvector corresponding 
to Ai , and also for which the sequence of integer powers 
M^, r = 1, 00 is bounded. 

For matrices obeying these extra conditions we may 
exploit the Jordan normal form in the following way. Pick 
the smallest integer k such that Af^'^^ = for all Af^- 
Then can be written as follows: 



e 



E 

m— 0,. . -fc 



(67) 



If r is large, then all blocks corresponding to a ^ 1 will 
become small because of the X^~™ term, and so the only 
sizeable contribution to A/'' will come from the block 
corresponding to a = 1, i.e. the sub-block: 



E 

m=0,.., 



Mr 



(68) 



Now we have asserted that the sequence of operators Af 
is bounded. However, it is not too difficult to show that 
for r = I..00 the sequence of operators ()68p becomes 
unbounded if A/i is non-zero. This means that if the 
sequence of operators Af is bounded, we are forced to 
conclude that A/i = 0, and hence as M has a unique 
maximal eigenvector, this means that Ii is an identity 
matrix of dimension 1x1, i.e. Ii = 1. 

Putting all this together means that a square matrix 
A/ with a unique maximal eigenvalue 1, such that the 
sequence Af is bounded, may be decomposed as: 



Af = l©0(A„I„+AA„) 
This means that A/'' can be written in the form: 



(69) 



Af 



1©A? 



e E 

Q^l 7n—0,...k 



Af!! 



(70) 



For our purposes it will be convenient to pull out a factor 
r*^ from the term in square brackets: 



ler^A? 



e E 

Q^l m— 0,. . , 



(') 

\ni/ 



Ar 



Af" 



(71) 



This has the advantage of making the operator in square 
brackets bounded even as r 00. This form for Af" 
will be extremely useful to us. We will apply it to a 
completely positive map that can be associated to any 
matrix product state. Using this, we will show the decay 
of correlations required. 

The relationship between matrix product states and 
CP maps is described in detail in articles such as [H, [lO] ■ 
Any matrix product state can be generated by repeat- 
edly acting on a fictitious ancilla particle using an ap- 
propriately constructed CP map. Suppose that we have 
a matrix product state of N particles j £ {1,..,7V}, 
each associated with a Hilbert space Hj. Consider also 
a fictitious 'generator' ancilla system on a finite dimen- 
sional space Hgen- It can be shown that the state of the 

particles in the matrix product state can be defined 
as the state that results from an appropriate CP map 
T : B{Hgen) B{Hgen) ^ 13 {Hj) which generates each 
particle j E {1, .., N} in sequence. The generating ancilla 
is then traced out to give the matrix product state of the 

particles. Related to the map T is the completely pos- 
itive map Q, which is the restriction of the map T to the 
generator ancilla as both input and output. The map Q 
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essentially represents the transfer matrix of the MPS - 
for a review of how to construct T for matrix product 
states, sec the article [soj . 

The starting state of the fictitious generator ancilla is 
usually taken as a fixed point of Q, in order that the 
MPS be translationally invariant. Away from a phase 
transition point, the CP map Q has a unique fixed point 
of eigenvalue 1, with all other eigenvalues of absolute 
value strictly less than 1. Let this fixed point of Q be the 
state a. Furthermore, as Q is a CP map, it is clear that 
the sequence of maps QJ' is bounded. Hence as Q acts 
as a finite dimensional linear operator taking the ancilla 
space to itself, we can also think of it as a square matrix 
and apply equation (|7T|) to represent powers QJ" of the 
map. Let us use this form to compute the action of Q'' 
on an input density matrix lo of the fictitious ancilla. As 
any density matrix is taken to a density matrix by a CP 
map, we may apply (|7ip to give that the output of 
must have the following form: 

Q:{uj) = a + r''\lQr (72) 

where in the second term Qr is a sequence of operators 
whose norm can be bounded, and the r^X^ term which 
governs the size of the deviation from the final fixed point 
a arises as a consequence of equation (j7ip . This equation 
essentially states that the deviation of QJ^i^jj) from a falls 
off as fast as r^\2- Although the explicit form of 8,. 
depends upon the input state, a bound on the norm of 
Or can easily be constructed that is independent of uj. 
This means that limr_,oo = where we define E as 
the (idempotent) channel that discards the input ancilla 
state and creates a copy of a in its place. For finite r we 
may write: 

or ^Y^ + r^X^Q' (73) 

where Q' is now represents operations of bounded norm 
acting on states of the ancilla (we have dropped the po- 
tential r-dependence of 0' to keep notation uncluttered, 
as it is unimportant). 

Our goal in the remainder of this subsection will be to 
apply this deviation estimate to show that equation (|63p 
holds for matrix product systems. This can be done in 
two steps. In the first step we show that for two large 
blocks of length L separated by a distance d (eventually 
L will become the length of the live blocks and d will 
become the spacer distance 51) the reduced state can be 
approximated by a product. The second step will use 
the triangle inequality to go from this result to the full 
condition 

The first step proceeds as follows. For convenience we 
will consider a chain of total chain of length 2n + 2i + d, 
for which the state of the whole chain can be written: 

tra„c{T"+^+''+^+"(^)}. (74) 

If we take the limit as n — > oo, the reduced state of the 
two large blocks A, B each of length L can be written 

PAB = tranc{Sr^Q''r^S(CT)}, (75) 



and the individual reduced states of each block A, B can 
be written: 

PA = tra„c{Sr^S(a)}, (76) 

and 

PB = tra„c{ST^E(a)}. (77) 

Now from equation (|73p we know that up to a correction 
d^\2& , the channel Q"^ becomes equivalent to S. Hence 
we find that pab and pA ® Pb deviate as follows: 

II PAB -Pa®Pb\\ = d'^Xi II tra„c{ST^O'r^I](cc;)} || 
< constant x d'^Aj (78) 

where the constant is independent of L. Now for our 
situation L is simply the size of each block I, and the 
spacing between the blocks is s = 51. Hence for two live 
blocks separated by one spacer block this bound becomes: 

II PAB — PA'^ Pb \\ < constant x s'^Aj 

< constant X s*^ exp(log(A2)s) 

To go from this result for two live blocks to equation 
([BS]) one simply notes that the above argumentation can 
also be applied to blocks of unequal size, and then the 
triangle inequality applied to sequences sums of a similar 
structure to \\pLiL2L3Li - Pl^l-^l^ (8 PL4II + WpliL-^Ls ® 
PL4 — PL1L2 ® PL3 ® PiW yields equation ([55]) with only a 
polynomial overhead in I. 

Bosonic systems - Here we consider chains of harmonic 
oscillators whose Hamiltonian can be written in the form 

H = pp^/2 + yiVx^/2 (79) 

where h = I and we arrange the canonical conjugate 
position and momentum operators in vector form x = 
{xi,...,x„) and p ~ (pi, . . . ,pn) and introduced the 
so-called potential matrix V [37[. The potential matrix 
encodes the interaction pattern of the harmonic oscil- 
lators in the chain. From now on we assume that V 
is a k-bandcd matrix, i.e. Vij = for \i — i\ > k/2. 
Physically this implies that interaction strength van- 
ish strictly beyond the (fc/2)-th neighbour. An impor- 
tant quantity in this context is the symplectic matrix a 
which is defined by ajk ~ {[Rj^Rk]) where we denote 
R (xi, . . . ,i„,pi, . . . ,p„). 

The ground state of the Hamiltonian eq. ([75)1 is then a 
Gaussian state [ssl . in the sense that its characteristic 

function Xpi^) tr[pWz] where Wz ~ e*^^*^^ is the Weyl 
operator is Gaussian, i.e. 

Xp(z)=Xp(0)e-^-"^"''-+^"^ (80) 

where jj^ = 2Rc[RjRfi,p] and D = atr[Rp]. The density 
operator may then be recovered via 
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For the ground state the first moments D due to the 
reflection symmetry of the Hamiltonian. Therefore the 
ground state is fully characterized by the covariancc ma- 
trix 7, which is defined as 7j_fc = 2Re[RjRkp\ where we 
have explicitly used the fact that the first moments van- 
ish. An explicit computation reveals that the covariance 
matrix of the ground state of Hamiltonian cq. ([75)) is 
given by 7 = V-^/"^ © ^^/^ 

For the following proof of eq. ([63)) we will bound the 
trace norm by the quantum relative entropy using [4lj 

Property 1: For all density operators a,p we have 
S{a\\p) > i(ir|(T-/5|i)2 and S{aAB\\^A®aB) = Si&A® 
ctb) — S{ctab)- 

The entropy of a Gaussian state is determined by the 
symplectic eigenvalues {/ij } of 7 that are siniply the stan- 
dard eigenvalues of the ijcr. We then find [3^ 



N 



Sip) = 



where 



X + 1 X + 1 X — 1 X — 1 



(82) 



(83) 



In the following proof we will need to compute reduced 
density matrices. On the level of covariance matrices 
this is particularly easy as the covariance matrix of a 
sub-system A is obtained simply by removing all entries 
referring to operators in the complement of A. 

Before we proceed to the proof of property eq. ([55)) 
we first derive a useful Lemma that extends Fannes in- 
equality to Gaussian states. Fannes showed [iOl that for 
d— dimensional systems and A = tr\p — a\ < 1/e, we 
find \S{p) - S{&)\ < A log d- A log A. Obviously, in this 
form the theorem cannot be extended to infinite dimen- 
sional continuous variable systems as this would imply 
d ^ 00 which renders the upper bound trivial. Consid- 
ering Gaussian states however it is possible to derive a 
more useful Fannes-type inequality. 

Lemma I (Bosons): Given two N-mode Gaus- 
sian states Pi characterized by covariance matrices 7^ 
with symplectic eigenvalues {pl}j=i^...,N that satisfy 

0.17623008 is the nonzero 
+ k log2 fc = 2, we find 



maxj — /X2I < B, where B f 
solution of (fc + 2) log2(fc + 2) 



N 



\s{M-s{p2)\ < E- 1/^1-/^2 1 iog2l/^i 

< A log2 iV - A log A (84) 
where A = X^^i Im{ - M2l- 

Proof — A Gaussian state is a valid quantum mechanical 
state exactly if it satisfies the uncertainty relations 7 4- 
i(j > 0. This implies pj > 1 for all To bound the 
entropy eq. ([82)) we note that for f{x) as defined in eq. 
we have 



and that Vx > 1 and fc > we find ^[/(a; -I- fc) - f{x) 
klog^k] < 0. Thus we have < f{x + k) - f{x) 



< 



-fclog2fc for all x > 1 and k < B. Inserting this into 



the entropy formula eq. ([5^ we then find the first in- 
equality in Lemma I. The second inequality is obtained 
from the fact that the entropy of any probability distribu- 
tion with N non-zero probabilities is bounded by log2 N. 
This completes the proof. 

It is worthwhile noting that an analogous theorem may 
also be proven for the fermionic case [48|. 

Theorem 1: In an infinite chain of harmonic oscilla- 
tors in its ground state we pick two blocks, each consist- 
ing of L contiguous harmonic oscillators. The two blocks 
are separated from each other by d harmonic oscillators. 
Then we find that 



\pab-pa(E)Pb\\i < C(i)e-"'^ 



(86) 



for some polynomial C{L) and constant a independent 
of d. 

Proof — We will proceed using Lemma I to bound 
the entropy difference S{pab\\pa ® Pb) = S{pA <S) pb) — 
S(pab)- To this end we need to bound the difference in 
symplectic eigenvalues of the covariance matrices corre- 
sponding to Pa® Pb and pab- Property 1 then yields 
the desired result. 

We denote with Aground the ground state of the com- 
plete system and write the covariance matrix of the two 
blocks of harmonic oscillators (both of length L) in the 
(xi , pi , a;2 , _P2 , . . . ) ordering as 



Fa Tab 



(87) 



Given that the potential matrix V is banded we know 
from m, m, [131 that the entries of Aground decrease ex- 
ponentially in the distance d from the main diagonal. 
Therefore, the entries of Tab are exponentially decreas- 
ing with distance from the lower left corner whose entry 
is of the order Cie~°"^. 

We employ Theorem VIII. 3. 9 of which states that 



\X\{A)-X\{B)\\\ < ^ycond{S)cond{T)\\\A-B\ 



for every unitarily invariant norm and where S (T) diag- 
onalize A (B) and cond{S) = ||'S'||-||5'~^|| is the condition 
number. Given that the matrix iTa can be diagonalized 
by a matrix of the form C/F~^/^ we find 

\m)'-i^^'\\\ < (89) 

{cond{T)cond{TA®TB))^/^\\\<j{T~TA®TB)\\\. 

By the pinching inequality for Hermitean matrices 
m C{A) ^ A we find cond(TA © TB),cond{T) < 
cond(^ ground)- ToT the trace norm we then find 



lini [f[x 



fc) - /(x) + fc log2 fc] < for k<B (85) 



\\{p^^)^-{p\)%<2{cond{^g 



.d))^'^\\<yTAB\\i. (90) 
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Then \\Tab\\i < 2L||r^B||2 and ||a||i = 4i yield 

IIK)^ - (M2)^IIi < l6^cond{jground)L^C2e-"'' (91) 

for constants a and C'2 independent of L. Inserting this 
into Lemma I finishes the proof. 

As with matrix product states, apphcation of the tri- 
angular inequality then yields eq. ()63p . 



with F the composition of first applying an analytic ma- 
trix function to a covariance matrix and subsequently 
picking the sub-block describing the reduced state of a 
contiguous block of L harmonic oscillators. Analogously, 
denote with p,, the composition of first applying the r*'' 
matrix power followed by picking a sub-block as before. 



IX. PROOF OF PROPERTY EQ. 1^ FOR 
VARIOUS STATES 

Matrix product or finitely correlated states - We con- 
sider the same states as in section IVIIII and proceed sim- 
ilarly. We begin by computing 

pUah) = tr_{r'Q^W(a)} 

Applying equation (|73p again we can write the powers of 
Q as 

for two bounded operators Oi,02. In this equation in 
order to unclutter the notation we have replaced the first 
Z^(l -f (5) — / with the weaker estimate f , and the second 
one (in the exponent) by the weaker estimate l^ - in fact 
their form is not particularly important for what follows. 
Putting these expressions for the powers of Q into the 
expressions for the states, we find that for sufficiently 
large I: 

\\P\+A(i) - p\hi+S)\\ < constant X Af^'\ (92) 

Picking A(/) = for example, hence allows us to sat- 
isfy all the required conditions. 

Bosonic systems - As for condition eq. (j63p we 
consider the ground state for Hamiltonians that are 
quadratic in the canonical coordinates x and p and k- 
banded potential matrices V. The ground state is then 
given by 7 = F"^/^ ® V^^'^. 

Let us now consider pj^^ := pj^^^;-) with covariance 

matrix 71 and :— pje^j^^-j with covariance matrix 72, 
ie the reduced density matrices of a block of I spins in a 
chain of l+A{l) harmonic oscillators (described by covari- 
ance matrix Fi) and in a chain of l^ (1 + 5) harmonic oscil- 
lators (described by covariance matrix r2) respectively. 
Now we will demonstrate that the covariance matrices 71 
and 72 converge to each other in the limit I ^ 00. In the 
following we will chose, for our convenience, L sufficiently 
large to ensure that I + A{1) < 1^{1 + S). 

Given a k-banded potential matrix V let us chose a 
number r — A{l)/k. Then V' is A(^)-banded. Denote 



Then we conclude Pr{Ti) ~ Pri^i) due to the k- 
bandedness of V. Furthermore, by Bernsteins theorem 
(see [4^ for a short introduction) we then find 



||F(Fi) - ^^(F2)|| < ll^^(Fi) - ^(Fi)ll + |k(F2) - F(F2)|| 
^ 4M(x) 

" x''(x-i)' 



Because X > 1 (see UHl) this tends to zero with A(L) 
00. Choosing F{A) = A^^^ and F{A) = A'^/^ allows 
us to then to conclude that the difference of the covari- 
ance matrices 71 and 72 is bounded by an exponentially 
decreasing function in A(L). 



To continue, we proceed in two steps. First we show 
that the above property implies the weak convergence of 
the two reduced density matrices. Then we use this to 
show that this is already enough to imply the trace norm 
convergence. 

Lemma I: Given two Gaussian states p^^"* and p^^^ above 
with vanishing displacement and covariance matrices 7^^^ 

and 7^-' such that lim^^oo ||7l^'' ~7l^''|| = then for any 
sequence Xl , where | \Xl | | 1 < C*, with finite rank we have 
limi^oo ir[(p(^'^ -pf )Al] =0. 

Proof — Given that the Hamiltonian of the harmonic 
chain is gapped we find that j^^^ > cl for some constant 

c < 1 independent of L. Then chose 117^^^^ ~ 7l^^II ^ ^ < 
c < 1, |1 - e-^l < 2\x\ for a; < 1, |1 - e"^] < e'^l for all 
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X and Xl as above. Wc find 
|ir[(p(i)-p(2))x^]| = 

UlWi 
(27r)« 



< 



< 



( 1 T / (2) (1) \ 



(271-)" y|2|>2e-i/4 



4(2^)" J|.|<2.-i/4 
(27r)" iu|>2.-i/4 



'Nl>2 



(27I')- J|z|<2£-i/4 

(27rj" J|3|>2£-i/4 



where the last line follows from upper bounds on the error 
function. Note that the first term on the right hand side 
is proportional to trpj^ which is bounded by a constant 
independent of L because —\og2 trpj^ < S{pl) and the 
harmonic chain Hamiltonian obeys an entropy-area law 
[S^l- Thus for sufficiently small e the right hand side 
becomes arbitrarily small. This concludes the proof of 
Lemma I. 

Now we need to prove that weak convergence implies 
trace-norm convergence for harmonic chains. The follow- 
ing proof will use in an essential way the fact that the 
ground state of bosonic Hamiltonians that a re q uadratic 
in the canonical operators obey an area law |37l . . 

Lemma II: For the ground state of a bosonic Hamil- 
tonian H that is quadratic in the canonical coordinates 

the limit lim^^oo ^''[(Pl ■* ^ P^l^)^l] — for any sequence 
Xl, with < K, with finite rank already implies 

trace norm convergence limi_,oo HPl"* ~ = 0. 

Proof: Given < e < 1. To begin with we write 



spanned by the subspaces determined by Pi and P2 which 
defines P^. What we need is that km is bounded inde- 
pendent of L. To see this, it is important to note that 
the ground state of H satisfies an area law, i.e. in the 
1-D setting there is a constant C such that S^p^l') < C 
for all L. Let us denote by {A^}fe=o,...,oo the decreasingly 
ordered eigenvalues of p^^\ Note that for all k we have 
Ai < TT by trp''[' = 1. Thus we find 



Ipi^'-pflli < ||pi''-Ppi''P||i + ||Ppi'^P-Ppf Pill 



+ ||Ppi^^P-p^^^||i 



(2) 



(93) 



for some P that is yet to be determined. We now would 
like to establish the existence of a spectral projection P 
of finite rank such that \\p^^ — Pp^^ P\\i < e. In other 
words we aim to project onto the subspace made up of 
the eigenvectors corresponding to the km largest eigen- 

(i) 

values of p}^ . We argue that such a projection Pi^L exists 

(i) 

for each p)^ . Then one may project onto the subspace 



00 

C>~Y1 Ailog2Ai>log(A;„) ^ A 



k=k 



(94) 



k=k„ 



Therefore we find for the choice km > e"^/^ that 
Y^V=k ^k — ^ ^'^^ choice of L. Thus Pl can be cho- 
sen to be a rank km projector. Thus Pl is bounded in the 
trace norm but the subspace onto which it projects will 
generally depend on L. Note further that with the above 

Pl the weak convergence limi_>oo ^'"[(Pl ^ — Pl'')-^l] = 
implies that for sufficiently large L we have that 

WPlPl^Pl - PlP^l Pl\\i < Thus we find that for any 
e > and sufficiently large L we have ||/5^^'' — p^'^Hi < 3e 
thus establishing the required trace norm convergence. 



X. GENERALISATIONS TO OTHER 
INTERACTIONS 

It is natural to ask whether the approach that we have 
adopted can enable progress to be made for unitary inter- 
actions other than controUed-phase gates (or their higher 
dimensional analogues). Some generalisation are imme- 
diate. For instance, given any channels that are proba- 
bilistic applications of unitaries, where the unitaries are 
controlled on different classical or quantum basis states 
of the environment, expression p6p can easily be shown 
to be an explicit lower bound to the regularized coherent 
information. Hence if the environment state has suffi- 
ciently decaying correlations, expression p6p will also be 
a lower bound to channel capacity. In a similar man- 
ner it is likely that any channel whose capacity can be 
bounded by such simple entropic expressions will benefit 
from similar insights. 



XI. DISCUSSION & CONCLUSIONS 

We have considered models of correlated error inspired 
by many-body physics, with the aim of demonstrating 
behaviour in the capacity that parallels similar behaviour 
in the associated many-body systems. In this context 
a number of interesting questions which require further 
investigation. 

The first of these questions regards our initial motiva- 
tion - to find models of correlated error that display in- 
teresting non-analytic behaviour. However, non-analytic 
behaviour in many-body systems arises only in the ther- 
modynamic limit, and so our results unfortunately do not 
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really explain why the non-anal ytic ities that have been 
observed in papers such as [nl, 111, [11 occur for finite 
truncations of the channel. Furthermore, the quest for 
'genuine' non-analyticity is actually open to some debate 
- by redefining the parameters defining the channel, it 
is always possible to remove any non-analytic behaviour. 
However, we hope that our work may help to shed light on 
non-analytic behaviour for physically relevant parameter 
choices such as magnetic fields and inter-particle cou- 
plings (5^ . In realistic models of correlated error it is 
such forms of parametrization that will probably be most 
important. 

It will also be interesting to see how far the approach 
adopted here can be extended to other possible system 
environment interactions. The channels that we have in- 
vestigated above are all of a very specific kind - as random 
unitary channels, they do not permit quantum informa- 
tion to be transmitted from one system particle to an- 
other via the environment. More general channels with 
memory will have this property, and so it will be interest- 
ing to understand what effects this qualitative difference 
this can make. 

Another open question is whether the conditions (j63[ 
I64p can be established for wider families of many-body 
system. In addition to the systems for which we have 
demonstrated these conditions, recent work by M. Hast- 
ings demonstrates that they hold for the ground 
states of many fermionic systems too. His approach 
raises interesting questions concerning topological invari- 



ants which may have further significance for the problems 
considered in this paper. 

Finally, it is important to note that the connections 
made in [16| and this work are actually quite natural - 
entropies and correlations have a significant role in sta- 
tistical physics, and so quantum channel capacities with 
correlated error should have some connection to many- 
body physics. However, it would be nice to know if there 
is a deeper link, perhaps through a more direct connec- 
tion between coding theory and the physics of physical 
systems such as spin chains. 
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Bernstein's theorem concerns the approximation of func- 
tions by polynomials [i^]. Given the set Pr of polynomials 
of degree r or less with real coefficients. For a continuous 
function F on the interval [—1, 1] the best approximation 
error is defined as 



In addition suppose that F{z) is real for real z. Then 



Er{f) ^mi{\\F -p\\oo -PePr} 



where 



-Piloo 



max \F{x) — p(x) 

-1<X<1 



(95) 



(96) 



Now assume that F is analytic in an ellipse £^ with foci 
— 1 and 1 and with half axes a > 1 and (3 > 0. Then 
X > a -f /3. Then we have 

Theorem (Bernstein) — Let the function F be analytic 
in the interior of £^ with x > 1 ^-nd continuous on . 



[47] 



[49] 
[50] 



[51] 
[52] 



Er{F) < 



■2M{x) 

x'-(x-i) 



where 



M{x) = max \F{z) 



(97) 



(98) 



It is straightforward to adapt the theorem to other inter- 
vals and we will thus apply this theorem for all intervals. 
G. Meinardus, Approximations of Functions: Theory and 
Numerical Methods, Springer- Verlag, New York 1967. 
Indeed we find Lemma II (Fermions): Given two N- 
mode Gaussian states pi characterized by fermionic co- 
variance matrices F; and fermionic symplectic eigenval- 



ues fil that satisfy maxj 



■ M2 



< 0.6 then we find 



\s{pi) - s{p2}\ < 2j2^\i^-i4\^g2\i4^i4\ 

< 2(Alog2iV- AlogA) 



(99) 



■Mil 



where A = Ef=i l^i 
Proof — Remember that the fermionic sym- 
plectic eigenvalues \fij\ < 1 and that the en- 
tropy is given by S{p) = I]^ie(i^j) with 
f{x) = -i±2ilog2 i±2i - i^log2i^. straightforward 
analysis shows that for \x — y\ < 0.5 and < x,y < 1 
we have | - xlog^x + ylog^y] < -\x-y\l0g2 \x - y\ for 
all < x,y < 1. Thus for all x,y £ [0, 1] we find that 
1/(2;) — f{y)\ < 2|2; — yj logj -i^-j^. Inserting this into 
the entropy formula yields the first inequality in Lemma 
II. The second one follows in the same way as that of 
Lemma I. This completes the proof. 



M. Hastings, larXiv:0710.3324] 

The term 'it correlated' is sometimes more appropriate 
as we will also discuss the notion of correlated error in 
channels with a 2 or 3 spatial dimensional structure, such 
as might arise in 'egg box' storage such as optical lattices. 
In such cases 'memory' does not really have a meaning. 
Irreducibility means that given any starting state there 
is a non-zero probability of eventually going through any 
other state. 

In this context it may be important to note that this 
is also an issue in the definition of phase transitions. 
Some definitions of phase transitions avoid this problem 
by not relying explicitly on any parametrization, but in- 
stead by relying on the divergence of correlation functions 
or the non-uniqueness of a 'well defined' thermal state 
[2^ . |3Q] . Such definitions avoid the problems of defining 
non-analyticity, and may well have analogues in corre- 
lated error channels. 



